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Abstract 


This document describes the changes between Unicode 6.0.0 and Unicode 12.0.0 in the context of 
the current version of Internationalized Domain Names for Applications 2008 (IDNA2008). Some 
additions and changes have been made in the Unicode Standard that affect the values produced 
by the algorithm IDNA2008 specifies. IDNA2008 allows adding exceptions to the algorithm for 
backward compatibility; however, this document does not add any such exceptions. This 
document provides the necessary tables to IANA to make its database consistent with Unicode 
12.0.0. 


To improve understanding, this document describes systems that are being used as alternatives 
to those that conform to IDNA2008. 
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Standards is available in Section 2 of RFC 7841. 
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1. Introduction 


The current version of Internationalized Domain Names for Applications (IDNA) was initiated in 
2008, and despite not being completed until 2010, is widely known as "IDNA2008". It is specified in 
the series of documents listed in Section 2.1. The IDNA2008 standard includes an algorithm by 
which a derived property value is calculated based on the properties defined in the Unicode 
Standard. 


The derived property values that can be calculated are defined in RFC 5892 [RFC5892]. Below is a 
summary to aid in the reading of this document. For definition of the terms, please see RFC 5892 
[RFC5892]. 


PROTOCOL VALID: Those that are allowed to be used in IDNs. Code points with this property 
value are permitted for general use in IDNs. However, the fact that a label consists only of 
code points with this property value does not imply that the label can be used in DNS. The 
abbreviated term PVALID is used to refer to this value. 


CONTEXTUAL RULE REQUIRED: Some characteristics of the character, such as it being invisible 
in certain contexts or problematic in others, require that it not be used in labels unless specific 
other characters or properties are present. The abbreviated term CONTEXT is used to refer to 
this value. As explained in RFC 5892 [RFC5892], CONTEXT is in turn divided into CONTEXTJ and 
CONTEXTO. 


DISALLOWED: Those that should clearly not be included in IDNs. Code points with this property 
value are not permitted in IDNs. 


UNASSIGNED: Those code points that are not designated (1.е., are unassigned) in the Unicode 
Standard. 


When the Unicode Standard is updated, new code points are assigned and already assigned code 
points can have their property values changed. 


* Assigning code points can create problems if the newly assigned code points are 
compositions of existing code points and the normalization relationships associated with 
those code points should have been changed because of that. 

* Changing properties for already assigned code points can create problems if the property 
change results in changes to the derived property value. A previously allowed code point 
whose derived property value is PVALID may now be prohibited if its derived property value 
changes to DISALLOWED. The problem can also happen the other way around: a code point 
that was not allowed (and thus was prohibited) can suddenly be allowed. 
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* Problems can also be created if the properties assigned to those code points are inconsistent 
with IDNA2008 assumptions about how properties are assigned and/or about how code points 
with those properties are used or behave. 


There were three incompatible changes in the Unicode Standard between Unicode 5.2.0 
[Unicode-5.2.0] and Unicode 6.0.0 [Unicode-6.0.0]; they are described in RFC 6452 [RFC6452]. The 
code points U+0CF1 and U+0CF2 had a derived property value change from DISALLOWED to 
PVALID, and the code point U+19DA had a change in derived property value from PVALID to 
DISALLOWED. These changes where examined in great detail, but the IETF concluded that these 
changes to the Unicode Standard did not warrant an update to RFC 5892 [RFC5892]. 


As described in Section 3, more incompatible changes have been made to code points between 
Unicode 6.0.0 and Unicode 12.0.0 [Unicode-12.0.0]; however, the changes in the derived property 
values do not result in exceptions (as defined in Section 2.6 of RFC 5892 [RFC5892]) that would 
require an update to the "IDNA Contextual Rules" registry (which would also be considered an 
update to RFC 5892 [RFC5892]). 


Further, in 2015, the Internet Architecture Board (IAB) issued a statement [IAB2005-1] that advised 
the community to avoid using any of the potentially problematic code points and asked the IETF 
to resolve the issues related to the code point ARABIC LETTER BEH WITH HAMZA ABOVE 
(U+08A1) that was introduced in Unicode 7.0.0 [Unicode-7.0.0]. In February of that year, the 
statement was revised [IAB2005-2] to focus on the latter request. More details about the problem 
of code point sequences not normalizing as one might expect appear in a draft that was part of 
the discussion [IDNAT]. 


The result of the work in the IETF was that no exception was added to RFC 5892 [RFC5892]; 
however, it should be noted that the review of the issues around U+08A1 indicated that this code 
point is not an isolated case and that a number of long-standing PVALID code points may have 
similar issues. While the affected code points remain PVALID in this document, identification of 
the problem resulted in a clarification of the review process for new Unicode versions. That 
clarification, which reinforces the original review plan to capture issues like these, was published 
as RFC 8753 [RFC8753]. Any review of Unicode versions after 12.0.0 should be made according to 
RFC 8753 [RFC8753]; an objective of this document is to ensure that a proper review of such 
versions after version 12.0.0 can be made. 


2. Background 


2.1. IDNA2008 Documents 


IDNA2008 consists of the following documents. The documents in the set have informal names. 


• "Internationalized Domain Names for Applications (IDNA): Definitions and Document 
Framework" [RFC5890], informally called "Defs" or "Definitions", contains definitions and 
other material that are needed for understanding other documents in the set. 

• "Internationalized Domain Names in Applications (IDNA): Protocol" [RFC5891], informally 
called "Protocol", describes the core IDNA2008 protocol and its operations. It needs to be 
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interpreted in combination with the Bidi document (described below). RFC 5891 [RFC5891] 
obsoletes RFC 3491 [RFC3491] and, in particular, the use of the tables to which RFC 3491 
[RFC3491] refers. 


• "The Unicode Code Points and Internationalized Domain Names for Applications (IDNA)" 
[RFC5892], informally called "Tables", lists the categories and rules that identify the code 
points allowed in a label written in native character form (called a "U-label"), and is based on 
Unicode 5.2.0 [Unicode-5.2.0] code point assignments and additional rules unique to 
IDNA2008. The Unicode-based rules in RFC 5892 are expected to be stable across Unicode 
updates and hence independent of Unicode versions. 


* "Right-to-Left Scripts for Internationalized Domain Names for Applications (IDNA)" 
[RFC5893], informally called "Bidi", specifies special rules for labels that contain characters 
that are written from right to left. 


• "Internationalized Domain Names for Applications (IDNA): Background, Explanation, and 
Rationale" [RFC5894], informally called "Rationale", provides an overview of the protocol and 
associated tables, and gives explanatory material and some rationale for the decisions that 
led to IDNA2008. It also contains advice for DNS registry operators and others who use 
Internationalized Domain Names (IDNs). 


* "Mapping Characters for Internationalized Domain Names in Applications (IDNA) 2008" 
[RFC5895], informally called "Mapping", discusses the issue of mapping characters into other 
characters and provides guidance for doing so when that is appropriate. RFC 5895 provides 
advice only and is not a required part of IDNA. 


2.2. Additional Important IDNA2008-Related Documents 


There are other documents important for the understanding and functioning of IDNA2008, for 
example this. 


* "The Unicode Code Points and Internationalized Domain Names for Applications (IDNA) - 
Unicode 6.0" [RFC6452] describes some changes made to Unicode 6.0.0 [Unicode-6.0.0] that 
resulted in derived property value changes for the code points U+0CF1, U+0CF2, and U+19DA. 
U+0CF1 and U+0CF2 changed from DISALLOWED to PVALID, while U*19DA changed from 
PVALID to DISALLOWED. The IETF concluded that no update to RFC 5892 [RFC5892] was 
needed based on the changes made in Unicode 6.0.0 [Unicode-6.0.0]. As a result, the derived 
property value remained aligned with the Unicode Standard. Specifically, no exception was 
added. 


2.3. Deployment 


There are many variations on the general IDNA model in use in the various parts of the 
community. The following lists some of the strategies that implementations that claim to be IDNA 
compliant are known to use, but it should be noted the list is not complete: 


* IDNA2003 as specified in RFC 3490 [RFC3490] and RFC 3491 [RFC3491]. Those specifications are 
dependent on case folding, Normalization Form KC (NFKC), and on tables that specify for 
each code point whether it is allowed to be used or not, with a distinction made between use 
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for "stored strings" and "query strings". The tables themselves are dependent on Unicode 3.2 
[Unicode-3.2.0]. 


e A number of variations on IDNA2003, sometimes presented as "updated IDNA2003" or the like, 
which follow the principles of IDNA2003 as understood by the implementers but that use 
tables that represent how the implementers believe Stringprep [RFC3454] and Nameprep 
[RFC3491] would have evolved had the IETF not moved in the direction of IDNA2008 instead. 


*Amix between IDNA2003 and IDNA2008 where code points assigned to Unicode after Unicode 
3.2.0 [Unicode-3.2.0] have derived property value calculated according to the algorithm 
specified in IDNA2008. 


*Amix between IDNA2003 and IDNA2008 according to the Unicode Technical Standard #46 
[UTS-46]. Because that document specifies different profiles, there are several variations that 
leave users with no guarantee that two applications claiming conformance to UTS#46 will 
interoperate well with each other much less with conforming IDNA2008 implementations. 
UTS#46 is ultimately based on a normative table very much like the one used by Stringprep 
[RFC3454] but updated for each new version of Unicode. 


* The (normative) IDNA2008 algorithm applied to whatever version of Unicode Standard exists 
in the operating system and/or libraries used, independent of whatever version of tables 
appears in the (non-normative) IANA database. 


In practice, the Unicode Consortium creates a maximum set of code points by assigning code 
points in the Unicode Standard. The IDNA2008 rules use the Unicode Standard to create a further 
subset of code points and context that are permitted in DNS labels associated with its PVALID and 
CONTEXT (CONTEXTJ or CONTEXTO) derived property values. DNS registries and other 
organizations that deal with IDNs are supposed to create their own subsets from IDNA2008 for use 
by those registries and organizations. 


This progressive subsetting and narrowing of the repertoire of code points that can be used in 
labels is an implementation of the principles of being conservative when deciding what code 
points to include in such a subset. SAC-084 [SAC-084] and RFC 6912 [RFC6912] recommend to DNS 
registries and other organizations to be conservative when creating their subsets and to use the 
principle of creating subsets by inclusion. 


See also Security Considerations (Section 7) in this document. 


3. Notable Changes between Unicode 6.0.0 and 12.0.0 


Among the changes between the Unicode versions, most code points that change derived 
property value change from UNASSIGNED to PVALID or from UNASSIGNED to DISALLOWED. The 
interesting changes in derived property values include other changes. All changes between the 
major versions of Unicode can be found in Appendix A (6.0.0-7.0.0), Appendix B (7.0.0-8.0.0), 
Appendix C (8.0.0-9.0.0), Appendix D (9.0.0-10.0.0), Appendix E (10.0.0-11.0.0), and Appendix F 
(11.0.0-12.0.0). 
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3.1. Changes between Unicode 6.0.0 and 7.0.0 


Change in number of characters in each category: 


* PVALID changed from 97418 to 99867 (+2449) 

* UNASSIGNED changed from 865081 to 861509 (-3572) 
* CONTEXT] did not change, at 2 

* CONTEXTO did not change, at 25 

* DISALLOWED changed from 151586 to 152709 (+1123) 
* TOTAL did not change, at 1114112 


There are no changes made to Unicode between version 6.0.0 and 7.0.0 that impact IDNA2008 
calculation of the derived property values. 


The code points U+17B4 KHMER VOWEL INHERENT AQ and U+17B5 KHMER VOWEL INHERENT 
AA both changed the General Category from Cf (Format) to Mn (Nonspacing Mark), but that did 
not impact the calculation of the derived property value which stayed at DISALLOWED. 


The character ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) was introduced in Unicode 
7.0.0. This was discussed extensively in the IETF and also by the IAB in their statement [IAB2005-1] 
requesting the IETF to investigate the issue. Specifically, the IAB stated: 


On the same precautionary principle, the IAB recommends that the Internationalized 
Domain Names for Applications (IDNA) Parameters registry <https://www.iana.org/ 
assignments/idna-tables/> not be updated to Unicode 7.0.0 until the IETF has consensus 
on a solution to this problem. 


The discussion in the IETF concluded that although it is possible to create "the same" character in 
multiple ways, the issue with U+08A1 is not unique. The character U+08A1 (ARABIC LETTER BEH 
WITH HAMZA ABOVE) can be represented with the sequence ARABIC LETTER BEH (U+0628) and 
ARABIC HAMZA ABOVE (U+0654). This is identical to LATIN SMALL LETTER O WITH STROKE 
(U+00F8), which can be represented with the sequence LATIN SMALL LETTER O (U+006F) followed 
by COMBINING SHORT SOLIDUS OVERLAY (U+0337). 


Although the discussion about this specific code point resulted in acceptance of the derived 
property value of PVALID, the underlying problem with combining sequences is not understood 
fully. Therefore, it cannot be claimed that this case can be extrapolated to other situations and 
other code points. 


3.2. Changes between Unicode 7.0.0 and 10.0.0 


Change in number of characters in each category: 


* Code points that changed derived property value: 0 
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* PVALID changed from 99867 to 122411 (+22544) 

* UNASSIGNED changed from 861509 to 837775 (-23734) 
* CONTEXT] did not change, at 2 

* CONTEXTO did not change, at 25 

* DISALLOWED changed from 152709 to 153899 (+1190) 
* TOTAL did not change, at 1114112 


There are no changes made to Unicode between version 7.0.0 and 10.0.0 that impact IDNA2008 
calculation of the derived property values. 


3.3. Changes between Unicode 10.0.0 and 11.0.0 


Change in number of characters in each category: 


* Code points that changed derived property value: 1 

* PVALID changed from 122411 to 122734 (*323) 

* UNASSIGNED changed from 837775 to 837091 (-684) 

* CONTEXT] did not change, at 2 

* CONTEXTO did not change, at 25 

* DISALLOWED changed from 153899 to 154260 (+361) 

* TOTAL did not change, at 1114112 

* Georgian letters in the ranges U+10D0..U+10FA and U+10FD..U+10FF had their General 
Category changed from Lo (Other Letter) to Ll (Lowercase Letter) to reflect their status as the 
lowercase of new Georgian case pairs. Case mappings were also added. 

* SHARADA SANDHI MARK (U+111C9) General Category was changed from Po 
(Other Punctuation) to Mn (Nonspacing Mark), and the Bidi property was changed from L 
(Left to Right) to NSM (Nonspacing Mark). 

* The properties for ZANABAZAR SQUARE VOWEL SIGN AI (U*11A07) and ZANABAZAR SQUARE 
VOWEL SIGN AU (U+11A08) were corrected from Mc to Mn. 


* SPHERICAL ANGLE OPENING UP (U+29A1) was changed to Bidi Mirrored to No. 
These changes to the Unicode Standard have the following implications for these code points: 


* The newly assigned 684 characters are assigned a derived property value as of a result of 
applying the IDNA2008 algorithm. 

* The Georgian letters in the ranges U+10D0..U+10FA and U+10FD..U+10FF existed before 
IDNA2008 was created. Applying the IDNA2008 algorithm to the code points assigned the 
derived property value PVALID, and that value is unchanged even if the underlying Unicode 
properties have changed. The newly encoded Mtavruli letters have General Category Lu 
(Uppercase Letter) and are therefore DISALLOWED. 

* The 0+111С9 SHARADA SANDHI MARK was added to Unicode 8.0.0 [Unicode-8.0.0]. Applying 
the IDNA2008 algorithm to the code point assigned the derived property value DISALLOWED. 
The changes in the underlying properties in Unicode 11.0.0 [Unicode-11.0.0] caused the 
derived property value to change to PVALID. 
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* The characters ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and ZANABAZAR SQUARE 
VOWEL SIGN AU (U+11A08) were added to Unicode 10.0.0 [Unicode-10.0.0]. Applying the 
IDNA2008 algorithm to the code points assigned the derived property value PVALID, and that 
value is unchanged even if the underlying Unicode properties have changed. 

* SPHERICAL ANGLE OPENING UP (U+29A1) existed before IDNA2008 was created. Applying the 
IDNA2008 algorithm to the code point assigned the derived property value DISALLOWED, and 
that value is unchanged even if the underlying Unicode properties have changed. 


3.4. Changes between Unicode 11.0.0 and 12.0.0 


Change in number of characters in each category: 


* Code points that changed derived property value: 0 
* PVALID changed from 122734 to 123006 (+272) 

* UNASSIGNED changed from 837091 to 836537 (-554) 
* CONTEXT] did not change, at 2 

* CONTEXTO did not change, at 25 

* DISALLOWED changed from 154260 to 154542 (+282) 
* TOTAL did not change, at 1114112 


4. U*111C9 SHARADA SANDHI MARK 


As one can see in Section 3, an incompatible property change was made between Unicode 6.0.0 
and 12.0.0, affecting the code point U*111C9. Its derived property value thus changed from 
DISALLOWED to PVALID. In situations like these, IDNA2008 allows for addition of rules to RFC 
5892 [RFC5892], Section 2.7. If the code point is accepted, it might still be rejected if validated by 
software based on versions of Unicode older than 12.0.0. As the character is rarely used outside 
the group of Sharada specialists but is used in some records for indicating sandhi breaks, the 
conclusion was that it could either be added as an exception or allowed to change its property 
value. As including an exception would require implementation changes to deployments of 
IDNA20008, the IETF has decided not to add a BackwardCompatible rule to IDNA2008 (i.e., Section 
2.7 of RFC 5892 [RFC5892]) for this code point. This also ensures all sandhi marks are treated 
equally. 


5. Conclusion 


As described in Sections 3 and 4, changes have been made to Unicode between version 6.0.0 and 
12.0.0. Some changes to specific characters changed their derived property value, whereas other 
changes did not. Given the deployment considerations described in Section 2.3 and changes in the 
Unicode Standard described in Sections 3 and 4, including implications to normalization, the 
conclusion is not to add any exception rules to IDNA2008. 


This document addresses only changes to Unicode between version 6.0.0 and version 12.0.0. 
Changes in future Unicode versions might result in the conclusion that exception rules need to be 
added to IDNA2008 after the review process explained in RFC 8753 [RFC8753]. Separately from any 
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changes in Unicode, the IETF might conclude that updates to RFC 5892 [RFC5892] or other 
IDNA2008 documents might become necessary; such updates might include changes to the 
algorithm specified in IDNA2008 as well as additional rules, categories, or other forms of tuning, 
like the clarifications in RFC 8753 [RFC8753]. 


6. IANA Considerations 


IANA updated the "IDNA Rules and Derived Property Values" [[ANA-IDNA] registry after the expert 
reviewer validated that the derived property values were calculated correctly. 


7. Security Considerations 


This document makes recommendations regarding the use of the IDNA2008 algorithm for 
calculation of derived property values, based on Unicode version 12.0.0. This recommendation 
does not say anything about what recommendations to make for future versions of the Unicode 
Standard. 


Not following these recommendations can lead to various security issues. Specifically, allowing 
confusable characters may lead to various phishing attacks, as described in the Security 
Consideration Sections in the documents listed in Section 2.1. 
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037F 
0528 
0529 
052A 
052В 
052С 
0520 
052Е 
052Ғ 


058D. 
0604. 


061C 


08А0. 
08Е4. 


0978 
0980 
OAF0 
0сөө 
0C34 
0C81 
0001 


ODE6. 
OEDE. 


10C7 
10CD 


10FD. 
16F1. 
17B4. 
191D. 
1ABO. 


1ABE 


1BAB. 
1BBA. 
1СС0. 
1CF3. 
1CF8. 
1DE7. 
2066. 
20BA. 
23F4. 


2700 
27CB 
27CD 


2B4D. 
2B5A. 
2B76. 
2B98. 
2BBD. 
2BCA. 


2CF2 
2CF3 
2D27 
2D2D 


2D66. 
2E327 


9FCC 


A674. 
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.058F 
.0605 


.08B2 
.08FF 


DEF 
. BEDF 


.10FF 
.16F8 
.17B5 
ТӨЛЕ 
. TABD 


. 1BAD 
. 1BBF 
.1СС7 
.1СҒ6 
.1СҒ9 
HIDES 
.2069 
.20В0 
.23FA 


.2В4Ғ 
.2В73 
.2В95 
.2ВВ9 
.2ВС8 
.2В01 


.2067 
.2Е42 


.А67В 


DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
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GREEK CAPITAL LETTER YOT 

CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK 
CYRILLIC SMALL LETTER EN WITH LEFT HOOK 
CYRILLIC CAPITAL LETTER DZZHE 

CYRILLIC SMALL LETTER DZZHE 

CYRILLIC CAPITAL LETTER DCHE 

CYRILLIC SMALL LETTER DCHE 

CYRILLIC CAPITAL LETTER EL WITH DESCENDER 
CYRILLIC SMALL LETTER EL WITH DESCENDER 
RIGHT-FACING ARMENIAN ETERNITY SIGN. .ARMENIAN 
ARABIC SIGN SAMVAT..ARABIC NUMBER MARK ABOVE 
ARABIC LETTER MARK 

ARABIC LETTER ВЕН WITH SMALL V BELOW. . АКАВІС 
ARABIC CURLY FATHA..ARABIC MARK SIDEWAYS NOON 
DEVANAGARI LETTER MARWARI DDA 

BENGALI ANJI 

GUJARATI ABBREVIATION SIGN 

TELUGU SIGN COMBINING CANDRABINDU ABOVE 
TELUGU LETTER LLLA 

KANNADA SIGN CANDRABINDU 

MALAYALAM SIGN CANDRABINDU 

SINHALA LITH DIGIT ZERO..SINHALA LITH DIGIT N 
LAO LETTER KHMU GO..LAO LETTER KHMU NYO 
GEORGIAN CAPITAL LETTER YN 

GEORGIAN CAPITAL LETTER AEN 

GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL S 
RUNIC LETTER K..RUNIC LETTER FRANKS CASKET AE 
KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT 
LIMBU LETTER GYAN..LIMBU LETTER TRA 

COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBININ 
COMBINING PARENTHESES OVERLAY 

SUNDANESE SIGN VIRAMA..SUNDANESE CONSONANT SI 
SUNDANESE AVAGRAHA..SUNDANESE LETTER FINAL M 
SUNDANESE PUNCTUATION BINDU SURYA..SUNDANESE 
VEDIC SIGN ROTATED ARDHAVISARGA..VEDIC SIGN U 
VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING 
COMBINING LATIN SMALL LETTER ALPHA..COMBINING 
LEFT-TO-RIGHT ISOLATE..POP DIRECTIONAL ISOLAT 
TURKISH LIRA SIGN..RUBLE SIGN 

BLACK MEDIUM LEFT-POINTING TRIANGLE..BLACK CI 
BLACK SAFETY SCISSORS 

MATHEMATICAL RISING DIAGONAL 

MATHEMATICAL FALLING DIAGONAL 

DOWNWARDS TRIANGLE-HEADED ZIGZAG ARROW..SHORT 
SLANTED NORTH ARROW WITH HOOKED HEAD..DOWNWAR 
NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGH 
THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARR 
BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-P 
TOP HALF BLACK CIRCLE..UNCERTAINTY SIGN 
COPTIC CAPITAL LETTER BOHAIRIC KHEI 

COPTIC SMALL LETTER BOHAIRIC KHEI 

GEORGIAN SMALL LETTER YN 

GEORGIAN SMALL LETTER AEN 

TIFINAGH LETTER YE..TIFINAGH LETTER YO 

TURNED COMMA..DOUBLE LOW-REVERSED-9 QUOTATION 
«CJK Ideograph> 

COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBI 
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A698 
A699 
A69A 
A69B 


A69C.. 


A69F 
A792 


A793.. 


A796 
A797 
A798 
A799 
A79A 
A79B 
A79C 
A79D 
A79E 
A79F 


А?АА.. 
А?В0.. 


A7F7 


A7F8.. 
А9Е0.. 
AA7C.. 
AAE®@.. 
AAF®@.. 
AAF2.. 
АВЗ0.. 
ABSB.. 
AB64.. 
Б.А Eee 
БЕ2 ЕЕ 
1018В.. 


101А0 
102Е0 


ПОЕТ 


1031F 


10350-- 
10500.. 
10530.. 


1056Ғ 


10600.. 
10740.. 
110760. 
10860.. 
10877.. 
10880.. 
108A7.. 
10980.. 
109BE.. 
10A80.. 
10A9D.. 
10АС0.. 


10AC8 


10АС9.. 
10АЕВ.. 
10В80.. 
10В99.. 


Faltstrom 


DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
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CYRILLIC 
CYRILLIC 
CYRILLIC 


CAPITAL LETTER DOUBLE O 

SMALL LETTER DOUBLE O 

CAPITAL LETTER CROSSED O 

CYRILLIC SMALL LETTER CROSSED 0 

MODIFIER LETTER CYRILLIC HARD SIGN. .MODIFIER 
COMBINING CYRILLIC LETTER IOTIFIED E 

LATIN CAPITAL LETTER C WITH BAR 

LATIN SMALL LETTER C WITH BAR..LATIN SMALL LE 
LATIN CAPITAL LETTER B WITH FLOURISH 

LATIN SMALL LETTER B WITH FLOURISH 

LATIN CAPITAL LETTER F WITH STROKE 

LATIN SMALL LETTER F WITH STROKE 

LATIN CAPITAL LETTER VOLAPUK AE 

LATIN SMALL LETTER VOLAPUK AE 

LATIN CAPITAL LETTER VOLAPUK OE 

LATIN SMALL LETTER VOLAPUK OE 

LATIN CAPITAL LETTER VOLAPUK UE 

LATIN SMALL LETTER VOLAPUK UE 

LATIN CAPITAL LETTER H WITH HOOK..LATIN CAPIT 
LATIN CAPITAL LETTER TURNED K..LATIN CAPITAL 
LATIN EPIGRAPHIC LETTER SIDEWAYS I 

MODIFIER LETTER CAPITAL Н WITH STROKE. .MODIFI 
MYANMAR LETTER SHAN GHA..MYANMAR LETTER TAI L 
MYANMAR SIGN TAI LAING TONE-2..MYANMAR LETTER 
MEETEI MAYEK LETTER E..MEETEI MAYEK VOWEL SIG 
MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHU 
MEETEI MAYEK ANJI..MEETEI MAYEK VIRAMA 

LATIN SMALL LETTER BARRED ALPHA..LATIN SMALL 
MODIFIER BREVE WITH INVERTED BREVE..MODIFIER 
LATIN SMALL LETTER INVERTED ALPHA..GREEK LETT 
CJK COMPATIBILITY IDEOGRAPH-FA2E..CJK COMPATI 
COMBINING LIGATURE LEFT HALF BELOW..COMBINING 
GREEK ONE QUARTER SIGN..GREEK SINUSOID SIGN 
GREEK SYMBOL TAU RHO 

COPTIC EPACT THOUSANDS MARK 

COPTIC EPACT DIGIT ONE..COPTIC EPACT NUMBER N 
OLD ITALIC LETTER ESS 

OLD PERMIC LETTER AN..COMBINING OLD PERMIC LE 
ELBASAN LETTER A..ELBASAN LETTER KHE 
CAUCASIAN ALBANIAN LETTER ALT..CAUCASIAN ALBA 
CAUCASIAN ALBANIAN CITATION MARK 

LINEAR A SIGN AB001..LINEAR A SIGN A664 
LINEAR A SIGN A701 A..LINEAR A SIGN A732 JE 
LINEAR A SIGN A800..LINEAR A SIGN A807 
PALMYRENE LETTER ALEPH..PALMYRENE LETTER TAW 
PALMYRENE LEFT-POINTING FLEURON..PALMYRENE NU 
NABATAEAN LETTER FINAL ALEPH..NABATAEAN LETTE 
NABATAEAN NUMBER ONE..NABATAEAN NUMBER ONE HU 
MEROITIC HIEROGLYPHIC LETTER A..MEROITIC CURS 
MEROITIC CURSIVE LOGOGRAM RMT..MEROITIC CURSI 
OLD NORTH ARABIAN LETTER HEH..OLD NORTH ARABI 
OLD NORTH ARABIAN NUMBER ONE..OLD NORTH ARABI 
MANICHAEAN LETTER ALEPH..MANICHAEAN LETTER WA 
MANICHAEAN SIGN UD 

MANICHAEAN LETTER ZAYIN..MANICHAEAN ABBREVIAT 
MANICHAEAN NUMBER ONE..MANICHAEAN PUNCTUATION 
PSALTER PAHLAVI LETTER ALEPH..PSALTER PAHLAVI 
PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAVI 
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10ВА9.. 


1107Ғ 


11000.. 
MOE Or 
ПЕШ Өте 
186. 
11140. . 
ЛИБО 
11174. . 


11176 


ІШІ 51252 
ПАМ 5 


111CD 


111D0.. 
db EXIST 
1200 
121132. 
1 123821 
ШЕВ бе 
Til E. 
ШӘ = 
П. 
ПЗӨЕ. 
12719 
ПИЗА. 
1193251 
(13355 
13363 
11347.. 
1134B.. 


19:57 


1135D.. 
11366.. 
MESS AS) oe 
11480.. 


114C6 
114C7 


11400.. 
11:580. 
11588. . 
10502. 
11600.. 
11641.. 


11644 


11650.. 
11680.. 
116С0.. 
118А0.. 
118С0.. 
118EA.. 


118FF 


11АС0.. 
12:36 [5-7 
12463.. 


12474 


16A40.. 
16А60.. 
16A6E.. 
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DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 


IDNA2008 and Unicode 12 March 2022 


PSALTER PAHLAVI NUMBER ONE..PSALTER PAHLAVI N 
BRAHMI NUMBER JOINER 

SORA SOMPENG LETTER SAH..SORA SOMPENG LETTER 
SORA SOMPENG DIGIT ZERO..SORA SOMPENG DIGIT N 
CHAKMA SIGN CANDRABINDU..CHAKMA MAAYYAA 
CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE 

CHAKMA SECTION MARK..CHAKMA QUESTION MARK 
MAHAJANI LETTER A..MAHAJANI SIGN NUKTA 
MAHAJANI ABBREVIATION SIGN..MAHAJANI SECTION 
MAHAJANI LIGATURE SHRI 

SHARADA SIGN CANDRABINDU..SHARADA OM 

SHARADA DANDA..SHARADA SEPARATOR 

SHARADA SUTRA MARK 

SHARADA DIGIT ZERO..SHARADA EKAM 

SINHALA ARCHAIC DIGIT ONE..SINHALA ARCHAIC NU 
KHOJKI LETTER A..KHOJKI LETTER JJA 

KHOJKI LETTER NYA..KHOJKI SIGN SHADDA 

KHOJKI DANDA..KHOJKI ABBREVIATION SIGN 
KHUDAWADI LETTER A..KHUDAWADI SIGN VIRAMA 
KHUDAWADI DIGIT ZERO..KHUDAWADI DIGIT NINE 
GRANTHA SIGN CANDRABINDU..GRANTHA SIGN VISARG 
GRANTHA LETTER A..GRANTHA LETTER VOCALIC L 
GRANTHA LETTER EE..GRANTHA LETTER AI 

GRANTHA LETTER 00. .СКАМТНА LETTER МА 

GRANTHA LETTER PA..GRANTHA LETTER RA 

GRANTHA LETTER LA..GRANTHA LETTER LLA 

GRANTHA LETTER VA..GRANTHA LETTER HA 

GRANTHA SIGN NUKTA..GRANTHA VOWEL SIGN VOCALI 
GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI 
GRANTHA VOWEL SIGN 00..GRANTHA SIGN VIRAMA 
GRANTHA AU LENGTH MARK 

GRANTHA SIGN PLUTA..GRANTHA VOWEL SIGN VOCALI 
COMBINING GRANTHA DIGIT ZERO..COMBINING GRANT 
COMBINING GRANTHA LETTER A..COMBINING GRANTHA 
TIRHUTA ANJI..TIRHUTA GVANG 

TIRHUTA ABBREVIATION SIGN 

TIRHUTA OM 

TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE 
SIDDHAM LETTER A..SIDDHAM VOWEL SIGN VOCALIC 
SIDDHAM VOWEL SIGN E..SIDDHAM SIGN NUKTA 
SIDDHAM SIGN SIDDHAM..SIDDHAM END OF TEXT MAR 
MODI LETTER A..MODI SIGN ARDHACANDRA 

MODI DANDA..MODI ABBREVIATION SIGN 

MODI SIGN HUVA 

MODI DIGIT ZERO..MODI DIGIT NINE 

TAKRI LETTER A..TAKRI SIGN NUKTA 

TAKRI DIGIT ZERO..TAKRI DIGIT NINE 

WARANG CITI CAPITAL LETTER NGAA..WARANG CITI 
WARANG CITI SMALL LETTER NGAA..WARANG CITI DI 
WARANG CITI NUMBER TEN..WARANG CITI NUMBER NI 
WARANG CITI OM 

PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL ST 
CUNEIFORM SIGN KAP ELAMITE..CUNEIFORM SIGN UM 
CUNEIFORM NUMERIC SIGN ONE QUARTER GUR..CUNEI 
CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON 
MRO LETTER TA..MRO LETTER TEK 

MRO DIGIT ZERO..MRO DIGIT NINE 

MRO DANDA..MRO DOUBLE DANDA 
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16А00. 
16AF0. 


16AF5 


16B00. 
16B37. 
16В40. 
16В44. 
16В50. 
16B5B. 
16B63. 
16B7D. 
16F00. 
16F50. 
16F8F. 
1ВС00. 
1ВС70. 
1ВС80. 
1ВС90. 


1ВС9С 


1BC9D. 
1BC9F. 
1E800. 
1E8C7. 
1Е800. 
1ЕЕ00. 
IEEOS- 
TEE2Z:- 


1EE24 
TEE27 


TEE29- 
1EE34. 


1EE39 
1EE3B 
1EE42 
1EE47 
1EE49 
1EE4B 


1EE4D.. 
TEES 


1EE54 
1EE57 
1EE59 
1EE5B 
ТЕЕ5р 
1EESF 


1EE61.. 


1EE64 


1EE67.. 
ТЕЕ6 2 
1ЕЕ74.. 
ПЕЕЛ 


1EE7E 


EES Ors 
TEESB T 
1EEA1.. 
1ЕЕА5.. 
1ЕЕАВ.. 
TEEEOPT 
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. 16AED; 
. 16AF4; 


.16B36; 
.16B3F ; 
.16B43; 
.16B45; 
.16B59; 
.16B61; 
.16B77; 
.16B8F ; 
.16F44; 
.16F7E; 
.16F9F ; 
.1BC6A ; 
186767 
.1BC88; 
.1BC99; 


.1ВС9Е; 
.1ВСАЗ; 
.1Е8С4; 
. 1E8CF ; 
. 1E8D6; 
. 1EE@3 ; 
ЕЕ ЕЗ 
SIEE22$, 


.1EE32: 
.TEE37: 


PVALID 

PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

PVALID 

PVALID 

PVALID 

PVALID 

PVALID 

PVALID 

PVALID 

PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
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BASSA VAH LETTER ENNI..BASSA VAH LETTER I 
BASSA VAH COMBINING HIGH TONE..BASSA VAH COMB 


BASSA VAH FULL STOP 
PAHAWH HMONG 
PAHAWH HMONG 
PAHAWH HMONG 
PAHAWH HMONG 
PAHAWH HMONG 
PAHAWH HMONG 
PAHAWH HMONG 
PAHAWH HMONG 


MIAO LETTER PA. .MIAO LETTER HHA 
MIAO LETTER NASALIZATION. .MIAO VOWEL SIGN NG 
MIAO TONE RIGHT..MIAO LETTER REFORMED TONE-8 


DUPLOYAN 
DUPLOYAN 
DUPLOYAN 
DUPLOYAN 
DUPLOYAN 
DUPLOYAN 
DUPLOYAN 


VOWEL KEEB..PAHAWH HMONG MARK CI 
SIGN VOS THOM. .PAHAWH HMONG SIGN 
SIGN VOS SEEV..PAHAWH HMONG SIGN 
SIGN XAUS..PAHAWH HMONG SIGN CIM 
DIGIT ZERO..PAHAWH HMONG DIGIT N 
NUMBER TENS. .PAHAWH HMONG NUMBER 
SIGN VOS LUB..PAHAWH HMONG SIGN 

CLAN SIGN TSHEEJ..PAHAWH HMONG C 


LETTER H..DUPLOYAN LETTER VOCALIC M 

AFFIX LEFT HORIZONTAL SECANT..DUPLOY 
AFFIX HIGH ACUTE..DUPLOYAN AFFIX HIG 
AFFIX LOW ACUTE..DUPLOYAN AFFIX LOW 

SIGN O WITH CROSS 
THICK LETTER SELECTOR. .DUPLOYAN DOUB 
PUNCTUATION CHINOOK FULL STOP..SHORT 


MENDE KIKAKUI SYLLABLE M001 KI..MENDE KIKAKUI 
MENDE KIKAKUI DIGIT ONE..MENDE KIKAKUI DIGIT 
MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE K 


ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 
ARABIC 


MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
MATHEMATICAL 
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ALEF..ARABIC MATHEMATICAL 
WAW..ARABIC MATHEMATICAL 
INITIAL BEH..ARABIC MATHE 
INITIAL HEH 

INITIAL HAH 

INITIAL YEH..ARABIC MATHE 
INITIAL SHEEN..ARABIC MAT 
INITIAL DAD 

INITIAL GHAIN 

TAILED JEEM 

TAILED HAH 

TAILED YEH 

TAILED LAM 

TAILED NOON..ARABIC MATHE 
TAILED SAD..ARABIC MATHEM 
TAILED SHEEN 

TAILED KHAH 

TAILED DAD 

TAILED GHAIN 

TAILED DOTLESS NOON 
TAILED DOTLESS QAF 
STRETCHED BEH..ARABIC MAT 
STRETCHED HEH 

STRETCHED HAH..ARABIC MAT 
STRETCHED MEEM..ARABIC MA 
STRETCHED SHEEN..ARABIC M 
STRETCHED DAD..ARABIC MAT 
STRETCHED DOTLESS FEH 
LOOPED ALEF..ARABIC MATHE 


LOOPED LAM..ARABIC 
DOUBLE-STRUCK BEH.. 
DOUBLE-STRUCK WAW.. 
DOUBLE-STRUCK LAM.. 
OPERATOR MEEM WITH 


MATHEM 
ARABIC 
ARABIC 
ARABIC 
HAH WI 
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1FOBF 


1Ғ0Е0. 
1F10B. 
1F16A. 
T S2 


1F336 
1F37D 


1F394. 


1F3C5 


1F3CB. 
1F3D4. 
ЕЗЕТ 


1F43F 
1F441 
1F4F8 


1F4FD.. 
TESSEN: 
1F568.. 
ШЕВ: 
1F5A5.. 


1F600 
1F611 
1F615 
1F617 
1F619 
1F61B 
1F61F 


1F626.. 


1F62C 


ТЕ62Е. 


12634 


1F641.. 
1F650.. 
1F6C6.. 
1F6E0.. 
1F6F0.. 
ПЕЛ8ӨС 
1Ғ800.. 
ЛЕ8Л0Ө-- 
1F850.. 
1F860.. 
1F890.. 


Appendix B. 


Fáltstróm 


.TF0F5 
.1Ғ10С 
.1Ғ16В 
.1ЕЗ2С 


.TF39F 


.ТЕЗСЕ 


. 1F3DF 
. 1F3F7 


DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
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PLAYING CARD RED JOKER 

PLAYING CARD FOOL..PLAYING CARD TRUMP-21 
DINGBAT CIRCLED SANS-SERIF DIGIT ZERO. .DINGBA 
RAISED MC SIGN..RAISED MD SIGN 

THERMOMETER. .WIND BLOWING FACE 

HOT PEPPER 

FORK AND KNIFE WITH PLATE 

HEART WITH TIP ON THE LEFT..ADMISSION TICKETS 
SPORTS MEDAL 

WEIGHT LIFTER..RACING CAR 

SNOW CAPPED MOUNTAIN. .STADIUM 

WHITE PENNANT. .LABEL 

CHIPMUNK 

EYE 

CAMERA WITH FLASH 

FILM PROJECTOR. .PORTABLE STEREO 

LOWER RIGHT SHADOWED WHITE CIRCLE..DOVE OF PE 
RIGHT SPEAKER. .JOYSTICK 

LEFT HAND TELEPHONE RECEIVER..BLACK DOWN POIN 
DESKTOP COMPUTER. .WORLD MAP 

GRINNING FACE 

EXPRESSIONLESS FACE 

CONFUSED FACE 

KISSING FACE 

KISSING FACE WITH SMILING EYES 

FACE WITH STUCK-OUT TONGUE 

WORRIED FACE 

FROWNING FACE WITH OPEN MOUTH. .ANGUISHED FACE 
GRIMACING FACE 

FACE WITH OPEN MOUTH. .HUSHED FACE 

SLEEPING FACE 

SLIGHTLY FROWNING FACE..SLIGHTLY SMILING FACE 
NORTH WEST POINTING LEAF..REVERSE CHECKER BOA 
TRIANGLE WITH ROUNDED CORNERS. .BED 

HAMMER AND WRENCH. .AIRPLANE ARRIVING 
SATELLITE. .PASSENGER SHIP 

BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE. 
LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD 
LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWH 
LEFTWARDS SANS-SERIF ARROW. .UP DOWN SANS-SERI 
WIDE-HEADED LEFTWARDS LIGHT BARB ARROW. .WIDE- 
LEFTWARDS TRIANGLE ARROWHEAD. .WHITE ARROW SHA 


Changes from Unicode 7.0.0 to Unicode 8.0.0 


Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED. 
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08B3. 
Q8E3 
gAF9 
gC5A 
@D5F 
13F5 
T3E87 
20BE 
218A. 
2BEC. 
9FCD. 
A69E 
A78F 
A7B2. 
A7B5 
A7B6 
A7B7 
A8FC 
A8FD 


AB60.. 
AB70.. 
FE2E E 
MOSEO: 
108F4.. 
108FB.. 
109BC.. 
109С0.. 
10902.. 
10C80.. 
10СС0.. 
10CFA.. 


111C9 


111CA.. 


111DB 
111DC 


111DD.. 
1128.07 


11288 


1128A.. 
TIAS 2 
ПӘ Fu 


112A9 
11300 
11350 


115CA.. 
115D8.. 
1315720072 
ШАН 
ІШТА 22. 
ПИЗА 


12399 


12480.. 
14400.. 
1D1DE.. 
1D800.. 
1DA00.. 
1DA37.. 


Fáltstróm 


.08В4 


.13FD ` 
.218B ` 
ВЕЕ: 
.9FD5 : 


.А7В4 


PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
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ARABIC LETTER AIN WITH THREE DOTS BELOW. .ARAB 
ARABIC TURNED DAMMA BELOW 

GUJARATI LETTER ZHA 

TELUGU LETTER RRRA 

MALAYALAM LETTER ARCHAIC II 

CHEROKEE LETTER MV 

CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETT 
LARI SIGN 

TURNED DIGIT TWO..TURNED DIGIT THREE 
LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARRO 
«CJK Ideograph>..<CJK Ideograph> 

COMBINING CYRILLIC LETTER EF 

LATIN LETTER SINOLOGICAL DOT 

LATIN CAPITAL LETTER J WITH CROSSED-TAIL..LAT 
LATIN SMALL LETTER BETA 

LATIN CAPITAL LETTER OMEGA 

LATIN SMALL LETTER OMEGA 

DEVANAGARI SIGN SIDDHAM 

DEVANAGARI JAIN OM 

LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LET 
CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTE 
COMBINING CYRILLIC TITLO LEFT HALF. .COMBINING 
HATRAN LETTER ALEPH. .HATRAN LETTER (ОРН 
HATRAN LETTER SHIN..HATRAN LETTER TAW 

HATRAN NUMBER ONE..HATRAN NUMBER ONE HUNDRED 
MEROITIC CURSIVE FRACTION ELEVEN TWELFTHS. .ME 
MEROITIC CURSIVE NUMBER ONE..MEROITIC CURSIVE 
MEROITIC CURSIVE NUMBER ONE HUNDRED. .MEROITIC 
OLD HUNGARIAN CAPITAL LETTER A..OLD HUNGARIAN 
OLD HUNGARIAN SMALL LETTER A..OLD HUNGARIAN S 
OLD HUNGARIAN NUMBER ONE..OLD HUNGARIAN NUMBE 
SHARADA SANDHI MARK 

SHARADA SIGN NUKTA..SHARADA EXTRA SHORT VOWEL 
SHARADA SIGN SIDDHAM 

SHARADA HEADSTROKE 

SHARADA CONTINUATION SIGN..SHARADA SECTION MA 
MULTANI LETTER A..MULTANI LETTER GA 

MULTANI LETTER GHA 

MULTANI LETTER CA..MULTANI LETTER JJA 

MULTANI LETTER NYA..MULTANI LETTER BA 

MULTANI LETTER BHA..MULTANI LETTER RHA 
MULTANI SECTION MARK 

GRANTHA SIGN COMBINING ANUSVARA ABOVE 

GRANTHA OM 

SIDDHAM SECTION MARK WITH TRIDENT AND U-SHAPE 
SIDDHAM LETTER THREE-CIRCLE ALTERNATE I..SIDD 
AHOM LETTER KA..AHOM LETTER JHA 

AHOM CONSONANT SIGN MEDIAL LA..AHOM SIGN KILL 
AHOM DIGIT ZERO..AHOM DIGIT NINE 

AHOM NUMBER TEN..AHOM SYMBOL VI 

CUNEIFORM SIGN U U 

CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM S 
ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLY 
MUSICAL SYMBOL KIEVAN C CLEF..MUSICAL SYMBOL 
SIGNWRITING HAND-FIST INDEX..SIGNWRITING HEAD 
SIGNWRITING HEAD RIM..SIGNWRITING AIR SUCKING 
SIGNWRITING AIR BLOW SMALL ROTATIONS..SIGNWRI 
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1DA3B. 
1DA6D. 


1DA75 


1DA76. 


1DA84 


1DA85. 
1DA9B. 
1DAA1. 
1F32D. 
ТЕЗЕ? 
1F3CF. 
1F3F8. 


1F4FF 


1F54B. 
1F643. 


1F6D0 


1F910. 
1F980. 


1Ғ9С0 


2В820. 


.1DA6C; 
.1рА74; 


.1DA83: 


. 1DA8B; 
. 1DA9F ; 
. 1DAAF ; 
SIESZ ES 
PESZE; 
ESDS; 
PESEE, 


.1F54F; 
1Ғ644: 


.1F918; 
| 1F984: 


.2CEA1: 


PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

DISALLOWED 
PVALID 

PVALID 

DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
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SIGNWRITING 
SIGNWRITING 
SIGNWRITING 
SIGNWRITING 
SIGNWRITING 
SIGNWRITING 


MOUTH CLOSED NEUTRAL. .SIGNWRITING 
SHOULDER HIP SPINE..SIGNWRITING T 
UPPER BODY TILTING FROM HIP JOINT 
LIMB COMBINATION. .SIGNWRITING LOC 
LOCATION HEAD NECK 

LOCATION TORSO. .SIGNWRITING PAREN 
SIGNWRITING FILL MODIFIER-2..SIGNWRITING FILL 
SIGNWRITING ROTATION MODIFIER-2..SIGNWRITING 
HOT DOG. .BURRITO 

BOTTLE WITH POPPING CORK. .POPCORN 

CRICKET BAT AND BALL..TABLE TENNIS PADDLE AND 
BADMINTON RACQUET AND SHUTTLECOCK..EMOJI MODI 
PRAYER BEADS 

КААВА. .BOWL OF HYGIEIA 

UPSIDE-DOWN FACE..FACE WITH ROLLING EYES 
PLACE OF WORSHIP 

ZIPPER-MOUTH FACE..SIGN OF THE HORNS 

CRAB. .UNICORN FACE 

CHEESE WEDGE 

<CJK Ideograph Extension E>..<CJK Ideograph E 


Appendix C. Changes from Unicode 8.0.0 to Unicode 9.0.0 


Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED. 


Fáltstróm 
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08B6. 
08D4. 
08E2 
0C80 
@D4F 
0054. 
0058. 
0076. 
1C80. 
1DFB 
23FB. 
2E43. 
A7AE 
A8C5 


1018D. 
104B0. 
104D8. 


1123E 


11400. 
1144B. 
11450. 


1145B 
1145D 


11660. 
11C00. 
11СӨА. 
11C38. 
11C41. 
11058. 
11C5A. 
11078. 
4102025 
11092. 
11CA9. 


16FE0 


17000. 
18800. 
1Е000. 
1Е008. 
1E01B. 
1Е023. 
1Е026. 
1Е900. 
1E9227 
1E9507 
TE95E- 
1F19B. 


1F23B 
1F57A 
1F5A4 


1F6D1.. 
1F6F4.. 
EOS 
11920 


1F930 


11Е933:- 
1Ғ940.. 


Fáltstróm 


. 08BD 
.08E1 


.0D56 
.0D5E 
.0D78 
.1C88 


ЗБЕ 
.2Е44 


.1018E: 
10403: 
.104FB: 


211444; 
"1144F: 
11459: 


.1166C; 
.11C08; 
.11C36; 
.11C40; 
.11C45; 
.11С59; 
.11C6C; 
EST 
SING SES, 
.11СА7; 
.11CB6; 


S197E65 
.18AF2; 
.1Е006; 
.1Е018; 
.1E021; 
.1Е024; 
.1E02A; 
11 Е92Л1> 
.1Е94А; 
.1Е959; 
SIES5ES 
. TF1AC; 


PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 


HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH HAH 
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ARABIC LETTER ВЕН WITH SMALL MEEM ABOVE. .ARAB 
ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL H 
ARABIC DISPUTED END OF AYAH 

KANNADA SIGN SPACING CANDRABINDU 

MALAYALAM SIGN PARA 

MALAYALAM LETTER CHILLU M..MALAYALAM LETTER C 
MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTIE 
MALAYALAM FRACTION ONE SIXTEENTH. .MALAYALAM F 
CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SM 
COMBINING DELETION MARK 

POWER SYMBOL..POWER SLEEP SYMBOL 

DASH WITH LEFT UPTURN. .DOUBLE SUSPENSION MARK 
LATIN CAPITAL LETTER SMALL CAPITAL I 
SAURASHTRA SIGN CANDRABINDU 

GREEK INDICTION SIGN. .NOMISMA SIGN 

OSAGE CAPITAL LETTER A..OSAGE CAPITAL LETTER 
OSAGE SMALL LETTER A..OSAGE SMALL LETTER ZHA 
KHOJKI SIGN SUKUN 

NEWA LETTER A..NEWA SIDDHI 

NEWA DANDA..NEWA ABBREVIATION SIGN 

NEWA DIGIT ZERO..NEWA DIGIT NINE 

NEWA PLACEHOLDER MARK 

NEWA INSERTION SIGN 

MONGOLIAN BIRGA WITH ORNAMENT..MONGOLIAN TURN 
BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC 
BHAIKSUKI LETTER E..BHAIKSUKI VOWEL SIGN VOCA 
BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN AVAGRA 
BHAIKSUKI DANDA..BHAIKSUKI GAP FILLER-2 
BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE 
BHAIKSUKI NUMBER ONE..BHAIKSUKI HUNDREDS UNIT 
MARCHEN HEAD MARK..MARCHEN MARK SHAD 

MARCHEN LETTER KA..MARCHEN LETTER A 

MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINE 
MARCHEN SUBJOINED LETTER YA..MARCHEN SIGN CAN 
TANGUT ITERATION MARK 

«Tangut Ideograph>..<Tangut Ideograph> 

TANGUT COMPONENT-001..TANGUT COMPONENT-755 
COMBINING GLAGOLITIC LETTER AZU..COMBINING GL 
COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING 
COMBINING GLAGOLITIC LETTER SHTA..COMBINING G 
COMBINING GLAGOLITIC LETTER YU..COMBINING GLA 
COMBINING GLAGOLITIC LETTER YO..COMBINING GLA 
ADLAM CAPITAL LETTER ALIF..ADLAM CAPITAL LETT 
ADLAM SMALL LETTER ALIF..ADLAM NUKTA 

ADLAM DIGIT ZERO..ADLAM DIGIT NINE 

ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL 
SQUARED THREE D..SQUARED VOD 

SQUARED CJK UNIFIED IDEOGRAPH-914D 

MAN DANCING 

BLACK HEART 

OCTAGONAL SIGN..SHOPPING TROLLEY 
SCOOTER. . CANOE 

CALL ME HAND..HAND WITH INDEX AND MIDDLE FING 
FACE WITH COWBOY HAT..SNEEZING FACE 

PREGNANT WOMAN 

SELFIE..HANDBALL 

WILTED FLOWER..MARTIAL ARTS UNIFORM 
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1Ғ950. 
1Ғ985. 


SIES5E; 
.1Ғ991; 
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DISALLOWED # CROISSANT..PANCAKES 
DISALLOWED # EAGLE. . $0010 


Appendix D. Changes from Unicode 9.0.0 to Unicode 10.0.0 


Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED. 


0860.. 


g9FC 
09Ғр 
OAFA. 
өрөө 
@D3B. 
1CF7 
1DF6. 
20BF 
23FF 
2BD2 
2bE45. 
312E 
9FD6. 


11A47 


11А50. 
11А86. 
11А9А. 
11A9E. 
11D00. 
11D08. 
11р0В. 


11D3A 


11D3C. 
11D3F. 
11050. 


16ҒЕ1 


1В002. 
1В170. 
1Ғ260. 
1F6D3. 
1F6F7. 
1F900. 


1F91F 


1F928. 
1F931. 


1F94C 


1F95F. 
1F992. 
1F9D0. 
2CEBO. 


Fáltstróm 


.OAFF 
. 0D3C 


.1DF9 


.2E49 


PORE Aas 
1032D. 
11400. 
11A3F. 


086A 


.1032F ; 
.1ТАЗЕ; 
.11А46; 


.11А83; 
.11А99; 
.11А9С; 
.11АА2; 
11006; 
11009; 
11036; 


.11D3D: 
“11047: 
711059: 


ВАТЕ 
.1В2ЕВ; 
STE2/6655 
. 1F6D4; 
“ТЕбЕЗ» 
.1Ғ90В; 


.1F92F; 
.1F932: 


. 1F96B; 
318997 
. 1F9E6; 
.2ЕВЕ0; 


PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 


HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH HHH HH SHE 


SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER MA 
BENGALI LETTER VEDIC ANUSVARA 

BENGALI ABBREVIATION SIGN 

GUJARATI SIGN SUKUN..GUJARATI SIGN TWO-CIRCLE 
MALAYALAM SIGN COMBINING ANUSVARA ABOVE 
MALAYALAM SIGN VERTICAL BAR VIRAMA. . МАГ АҮАГАМ 
VEDIC SIGN ATIKRAMA 

COMBINING KAVYKA ABOVE RIGHT..COMBINING WIDE 
BITCOIN SIGN 

OBSERVER EYE SYMBOL 

GROUP MARK 

INVERTED LOW KAVYKA..DOUBLE STACKED COMMA 
BOPOMOFO LETTER O WITH DOT ABOVE 

«CJK Ideograph>..<CJK Ideograph> 

OLD ITALIC LETTER YE..OLD ITALIC LETTER SOUTH 
ZANABAZAR SQUARE LETTER A..ZANABAZAR SQUARE C 
ZANABAZAR SQUARE INITIAL HEAD MARK. .ZANABAZAR 
ZANABAZAR SQUARE SUBJOINER 

SOYOMBO LETTER A..SOYOMBO LETTER KSSA 

SOYOMBO CLUSTER-INITIAL LETTER RA..SOYOMBO SU 
SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD 
SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPL 
MASARAM GONDI LETTER A..MASARAM GONDI LETTER 
MASARAM GONDI LETTER AI..MASARAM GONDI LETTER 
MASARAM GONDI LETTER AU..MASARAM GONDI VOWEL 
MASARAM GONDI VOWEL SIGN E 

MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI VO 
MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI RA 
MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT 
NUSHU ITERATION MARK 

HENTAIGANA LETTER A-1..HENTAIGANA LETTER N-MU 
NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB 
ROUNDED SYMBOL FOR FU..ROUNDED SYMBOL FOR CAI 
STUPA. . PAGODA 

SLED..FLYING SAUCER 

CIRCLED CROSS FORMEE WITH FOUR DOTS. .DOWNWARD 
I LOVE YOU HAND SIGN 

FACE WITH ONE EYEBROW RAISED..SHOCKED FACE WI 
BREAST-FEEDING. .PALMS UP TOGETHER 

CURLING STONE 

DUMPLING. .CANNED FOOD 

GIRAFFE FACE. .CRICKET 

FACE WITH MONOCLE..SOCKS 

<CJK Ideograph Extension F>..<CJK Ideograph E 
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Appendix E. Changes from Unicode 10.0.0 to Unicode 11.0.0 


Changes from derived property value DISALLOWED to PVALID. 


11109 ; PVALID # SHARADA SANDHI MARK 


Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED. 
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0560 
0588 
OSER 
07FD 
g ZEE. 
08D3 
09ҒЕ 
0A76 
0C04 
0C84 
1878 


1C90.. 
1CBD.. 
2BBA.. 
2BD3.. 
2BF0.. 
2E4A.. 


312F 


9FEB.. 


A7AF 
A7B8 
A7B9 


A8FE.. 
10A34.. 


10A48 


10D00.. 
10D30.. 
10F00.. 
TOE: D s: 


10F27 


10F30.. 
TOE Silas 


110CD 


11144.. 


1133B 
1145E 
1171A 


11800.. 


1183B 
11A9D 


11D60.. 
11D67.. 
11D6A.. 
11D90.. 
11D93.. 
11DA0.. 
THESE 
ПИЕ Е. 
16Е40.. 
16Е60.. 
16Е80.. 
187ED.. 
ПР ЕӨСС 
ПЗ. 
ТЕСТЕ. 


ТЕТ2Е 
1F6F9 


Fáltstróm 


.07FF 


PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
PVALID 
DISALLOWED 
DISALLOWED 
PVALID 
DISALLOWED 
PVALID 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
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ARMENIAN SMALL LETTER TURNED AYB 
ARMENIAN SMALL LETTER YI WITH STROKE 
HEBREW YOD TRIANGLE 

NKO DANTAYALAN 

NKO DOROME SIGN..NKO TAMAN SIGN 
ARABIC SMALL LOW WAW 

BENGALI SANDHI MARK 

GURMUKHI ABBREVIATION SIGN 

TELUGU SIGN COMBINING ANUSVARA ABOVE 
KANNADA SIGN SIDDHAM 
MONGOLIAN LETTER CHA WITH 
GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN 
GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIA 
OVERLAPPING WHITE SQUARES..OVERLAPPING BLACK 
PLUTO FORM TWO..STAR WITH RIGHT HALF BLACK 
ERIS FORM ONE..REVERSED RIGHT ANGLE 

DOTTED SOLIDUS..PUNCTUS ELEVATUS MARK 
BOPOMOFO LETTER NN 

«CJK Ideograph>..<CJK Ideograph> 

LATIN LETTER SMALL CAPITAL Q 

LATIN CAPITAL LETTER U WITH STROKE 

LATIN SMALL LETTER U WITH STROKE 

DEVANAGARI LETTER AY..DEVANAGARI VOWEL SIGN A 
KHAROSHTHI LETTER TTTA..KHAROSHTHI LETTER VHA 
KHAROSHTHI FRACTION ONE HALF 

HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA SIG 
HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA D 
OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER 
OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION 
OLD SOGDIAN LIGATURE AYIN-DALETH 

SOGDIAN LETTER ALEPH..SOGDIAN COMBINING STROK 
SOGDIAN NUMBER ONE..SOGDIAN PUNCTUATION HALF 
KAITHI NUMBER SIGN ABOVE 

CHAKMA LETTER LHAA..CHAKMA VOWEL SIGN EI 
COMBINING BINDU BELOW 

NEWA SANDHI MARK 

AHOM LETTER ALTERNATE BA 

DOGRA LETTER A..DOGRA SIGN NUKTA 

DOGRA ABBREVIATION SIGN 

SOYOMBO MARK PLUTA 

GUNJALA GONDI LETTER A..GUNJALA GONDI LETTER 
GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER 
GUNJALA GONDI LETTER 00..GUNJALA GONDI VOWEL 
GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VO 
GUNJALA GONDI VOWEL SIGN 00..GUNJALA GONDI OM 
GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT 
MAKASAR LETTER KA..MAKASAR VOWEL SIGN O 
MAKASAR PASSIMBANG..MAKASAR END OF SECTION 
MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN CAP 
MEDEFAIDRIN SMALL LETTER M..MEDEFAIDRIN SMALL 
MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN EXCLAMATI 
«Tangut Ideograph>..<Tangut Ideograph> 

MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN 
IDEOGRAPHIC TALLY MARK ONE..TALLY MARK FIVE 
INDIC SIYAQ NUMBER ONE..INDIC SIYAQ ALTERNATE 
COPYLEFT SYMBOL 

SKATEBOARD 


TWO DOTS 
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1F7D5. 
1F94D. 
1F96C. 
1F973. 


1F97A 


1F97C. 
1F998. 
1Ғ9В0. 
1Ғ9С1. 
EGET. 
1FA60. 


.1F7D8; 
.TF94F ; 
129705 
.1Е976; 


.1F97F; 
.1F9A2; 
.1F9B9; 
.1F9C2; 
.1F9FF; 
.1FA6D; 


DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
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CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE 
LACROSSE STICK AND BALL..FLYING DISC 

LEAFY GREEN..SMILING FACE WITH SMILING EYES A 
FACE WITH PARTY HORN AND PARTY НАТ. .FREEZING 
FACE WITH PLEADING EYES 

LAB COAT..FLAT SHOE 

KANGAROO. . SWAN 

EMOJI COMPONENT RED HAIR. .SUPERVILLAIN 
CUPCAKE..SALT SHAKER 

RED GIFT ENVELOPE. .NAZAR AMULET 

XIANGQI RED GENERAL. .XIANGQI BLACK SOLDIER 


Appendix F. Changes from Unicode 11.0.0 to Unicode 12.0.0 


Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED. 
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RFC 9233 IDNA2008 and Unicode 12 March 2022 
0C77 ; DISALLOWED # TELUGU SIGN SIDDHAM 
0Е86 ; PVALID # LAO LETTER PALI GHA 
0Е89 ; PVALID # LAO LETTER PALI CHA 
@E8C ; PVALID # LAO LETTER PALI JHA 
OE8E..0E93 ; PVALID # LAO LETTER PALI NYA..LAO LETTER PALI NNA 
0Е98 ; PVALID # LAO LETTER PALI DHA 
дЕА0 ; PVALID # LAO LETTER PALI BHA 
OEA8..0EA9 ; PVALID # LAO LETTER SANSKRIT SHA..LAO LETTER SANSKRIT 
@EAC ; PVALID # LAO LETTER PALI LLA 
@EBA ; PVALID # LAO SIGN PALI VIRAMA 
1CFA ; PVALID # VEDIC SIGN DOUBLE ANUSVARA ANTARGOMUKHA 
2BC9 ; DISALLOWED # NEPTUNE FORM TWO 
2BFF ; DISALLOWED # HELLSCHREIBER PAUSE SYMBOL 
2E4F ; DISALLOWED # CORNISH VERSE DIVIDER 
A7BA ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL A 
A7BB ; PVALID # LATIN SMALL LETTER GLOTTAL A 
A7BC ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL I 
A7BD ; PVALID # LATIN SMALL LETTER GLOTTAL I 
A7BE ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL U 
A7BF ; PVALID # LATIN SMALL LETTER GLOTTAL U 
A7C2 ; DISALLOWED # LATIN CAPITAL LETTER ANGLICANA W 
A7C3 ; PVALID # LATIN SMALL LETTER ANGLICANA W 
A7C4..A7C6 ; DISALLOWED # LATIN CAPITAL LETTER C WITH PALATAL HOOK. .LAT 
AB66..AB67 ; PVALID # LATIN SMALL LETTER DZ DIGRAPH WITH RETROFLEX 
10FE0..10FF6; PVALID # ELYMAIC LETTER ALEPH..ELYMAIC LIGATURE ZAYIN- 
1145F ; PVALID 3 NEWA LETTER VEDIC ANUSVARA 
116B8 PVALID 3 TAKRI LETTER ARCHAIC KHA 
119А0..119А7; PVALID 3 NANDINAGARI LETTER A..NANDINAGARI LETTER VOCA 
119AA..119D7; PVALID 3 NANDINAGARI LETTER E..NANDINAGARI VOWEL SIGN 
119DA..119E1; PVALID # NANDINAGARI VOWEL SIGN E..NANDINAGARI SIGN AV 
119E2 ; DISALLOWED # NANDINAGARI SIGN SIDDHAM 
119E3..119E4; PVALID 3 NANDINAGARI HEADSTROKE..NANDINAGARI VOWEL SIG 
11A84..11A85; PVALID à SOYOMBO SIGN JIHVAMULIYA..SOYOMBO SIGN UPADHM 
11FC@..11FF1; DISALLOWED # TAMIL FRACTION ONE THREE-HUNDRED-AND- TWENTIET 
11FFF ; DISALLOWED £ TAMIL PUNCTUATION END OF TEXT 
13430..13438; DISALLOWED £ EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN 
16F45..16F4A; PVALID # MIAO LETTER BRI..MIAO LETTER RTE 
16F4F ; PVALID # MIAO SIGN CONSONANT MODIFIER BAR 
16F7F..16F87; PVALID # MIAO VOWEL SIGN UOG..MIAO VOWEL SIGN UI 
16FE2 ; DISALLOWED # OLD CHINESE HOOK MARK 
16FE3 ; PVALID # OLD CHINESE ITERATION MARK 
187F2..187F7; PVALID # <Tangut Ideograph>..<Tangut Ideograph> 
1B150..1B152; PVALID # HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMA 
1B164..1B167; PVALID # KATAKANA LETTER SMALL WI..KATAKANA LETTER SMA 
1Е100..1Е12С; PVALID # NYIAKENG PUACHUE HMONG LETTER MA. .NYIAKENG PU 
1Е130..1Е130; PVALID # NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACH 
1Е140..1Е149; PVALID # NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG P 
1E14E ; PVALID # NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ 
1E14F ; DISALLOWED # NYIAKENG PUACHUE HMONG CIRCLED CA 
1E2C@..1E2F9; PVALID # WANCHO LETTER AA..WANCHO DIGIT NINE 
1E2FF ; DISALLOWED # WANCHO NGUN SIGN 
1E94B ; PVALID # ADLAM NASALIZATION MARK 
1ED@1..1ED3D; DISALLOWED # OTTOMAN SIYAQ NUMBER ONE. .OTTOMAN SIYAQ FRACT 
1F16C ; DISALLOWED # RAISED MR SIGN 
1F6D5 ; DISALLOWED # HINDU TEMPLE 
1F6FA ; DISALLOWED # AUTO RICKSHAW 
1F7E@..1F7EB; DISALLOWED # LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 
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1F90D. 


1F93F 
1F971 
1F97B 


1F9A5. 
1F9AE. 
1F9BA. 
1F9C3. 
1F9CD. 
1ҒА00. 
1ҒА70. 
1FA78. 
1ҒА80. 
1ҒА90. 


.TF90F ; 


.1F9AA; 
.1F9AF; 
ЕВЕ 
. 1F9CA; 
SEO CE 
.1FA53; 
.1FA73; 
. 1FA7A; 
.1ҒА82; 
.1ҒА95; 


DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
DISALLOWED 
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WHITE HEART..PINCHING HAND 
DIVING MASK 

YAWNING FACE 

SARI 

SLOTH- -OYSTER 

GUIDE DOG..PROBING CANE 
SAFETY VEST..MECHANICAL LEG 
BEVERAGE BOX..ICE CUBE 
STANDING PERSON..DEAF PERSON 
NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP 
BALLET ЅНОЕЅ. . SHORTS 

DROP OF BLOOD. .STETHOSCOPE 
YO-YO. .PARACHUTE 

RINGED PLANET. .BANJO 
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